Comparative performance study of several features for voiced/ non-voiced classification
نویسندگان
چکیده
This paper presents a comparative performance study of several time domain features for voiced/non-voiced classification of speech. Five classification schemes have been developed by combining one or two features amongst: Energy (E), Zeros Crossing Rate (ZCR), Autocorrelation Function (ACF), Average Magnitude Difference Function (AMDF), Weighted ACF (WACF), and the Discrete Wavelet Transform (DWT). The development of these classifiers was based on the selection of the lowest number of time domain features which allow voicing decision without the need of any frequency transformation or pre processing approaches. The performance of the classifiers has been evaluated on speech data extracted from the TIMIT database. Two different noise types: White and babble, taken from the NOISEX92 database have been incorporated to validate the developed classification schemes in noisy environments. An overall ranking of these classifiers for high and low Signal to Noise Ratios (SNRs) have been established based on the average value of the Percentage of classification accuracy (Pc).
منابع مشابه
Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملAcoustic features for detection of phonemic aspiration in voiced plosives
Plosives in Indo-Aryan languages such as Hindi and Marathi display a 4-way contrast involving the two dimensions of voicing and aspiration. While many studies are available on the acoustics of aspiration in unvoiced stops due to their more universal presence in the world’s languages, voiced aspirated plosives have been less studied. Rather than the release duration cue of aspiration in unvoiced...
متن کاملVoiced/Unvoiced and Silent Classification Using HMM Classifier based on Wavelet Packets BTE features
Wavelet Packets Best Tree Encoded (BTE) features is used here as base features for HMM classifier. The research aimed to introduce the newly designed features that are discussed in [1]. The considered problem is Voiced, Unvoiced and Silent classification. Comparison to the 19 filter banks features is provided. Although it is simple and straight forward, BTE makes comparable results to the 19 el...
متن کاملA Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the 'Color' of Whispered Phonemes and Deep Neural Network
In this work, we propose a robust method to perform framelevel classification of voiced (V) and unvoiced (UV) phonemes from whispered speech, a challenging task due to its voiceless and noise-like nature. We hypothesize that a whispered speech spectrum can be represented as a linear combination of a set of colored noise spectra. A five-dimensional (5D) feature is computed by employing non-negat...
متن کاملStructure-based Speech Classifcation Using Non-linear Embedding Techniques
Usable speech” is referred to as those portions of corrupted speech which can be used in determining a reasonable amount of distinguishing features of the speaker. It has previously been shown that the use of only voiced segments of speech improves the usable speech detection system, and also, that unvoiced speech does not contributes significant information about the speaker(s) for speaker ide...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 11 شماره
صفحات -
تاریخ انتشار 2014